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Abstract. Diverse analysis approaches have been proposed to distin- 
guish data missing due to death from nonresponse, and to summarize 
trajectories of longitudinal data truncated by death. We demonstrate 
how these analysis approaches arise from factorizations of the distri- 
bution of longitudinal data and survival information. Models are illus- 
trated using cognitive functioning data for older adults. For uncondi- 
tional models, deaths do not occur, deaths are independent of the longi- 
tudinal response, or the unconditional longitudinal response is averaged 
over the survival distribution. Unconditional models, such as random 
effects models fit to unbalanced data, may implicitly impute data be- 
yond the time of death. Fully conditional models stratify the longitu- 
dinal response trajectory by time of death. Fully conditional models 
are effective for describing individual trajectories, in terms of either 
aging (age, or years from baseline) or dying (years from death). Causal 
models (principal stratification) as currently applied are fully condi- 
tional models, since group differences at one timepoint are described 
for a cohort that will survive past a later timepoint. Partly conditional 
models summarize the longitudinal response in the dynamic cohort of 
survivors. Partly conditional models are serial cross-sectional snapshots 
of the response, reflecting the average response in survivors at a given 
timepoint rather than individual trajectories. Joint models of survival 
and longitudinal response describe the evolving health status of the en- 
tire cohort. Researchers using longitudinal data should consider which 
method of accommodating deaths is consistent with research aims, and 
use analysis methods accordingly. 
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1. INTRODUCTION 

Research studies often collect information at mul- 
tiple timepoints. For example, the Cardiovascular 
Health Study (CHS), an observational study of 5888 
older adults, conducted annual assessments of car- 
diovascular functioning and other health measures 
for up to 18 years. With these longitudinal mea- 
surements, CHS data have been used to study dis- 
ease course (Kaplan et al., 2005), health in the years 
leading up to a diagnosis (Diehr et al., 2001b) and 
the natural history of aging (Burke et al., 2001; 
Diehr et al., 2002). 

Missing data can be an impediment to interpret- 
ing longitudinal data. Suppose a cohort of 200 sub- 
jects report self-rated health at age 70 years, and 
only 150 of these subjects are located for follow-up. 
If the average self-rated health at age 75 is higher 
than the average at 70, the increase could reflect im- 
provement in individuals' health, attrition of sicker 
participants or death of sicker participants. 

Many analysis methods for longitudinal data with 
dropout and nonresponse have been proposed to ad- 
dress different research aims, study designs, missing 
data patterns and estimation strategies (Little and 
Rubin, 1987; Rubin, 1987; Robins, Rotnitzky and 
Zhao, 1995). Comparatively little work in statisti- 
cal methodology has addressed data missing when 
deaths occur during the period of follow-up, and re- 
cent work has focused mostly on principal stratifica- 
tion (Frangakis and Rubin, 2002; Frangakis et al., 
2007). Kurland and Heagerty (2005) characterized 
targets of inference for longitudinal data truncated 
by death by factorizations of the joint distribution of 
survival and longitudinal response, f{S,Y). The fac- 
torization was introduced primarily to provide con- 
text for a single approach, the partly conditional 
mean model. In this article, we explore several tar- 
gets of inference in detail, and give guidance on ap- 
propriate analysis techniques for common scientific 
questions arising for longitudinal data truncated by 
death. We present six modeling options, illustrated 
using both hypothetical and actual CHS data. The 
hypothetical data without measurement error illus- 
trate clearly how modeling choices for longitudinal 
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data reflect assumptions about survival. The real 
data illustrate how standard analysis techniques such 
as random effects models and generalized estimating 
equations (GEE) may be applied to address different 
research aims involving longitudinal data truncated 
by death. Bias, estimation and efficiency are impor- 
tant issues for data analysis. However, we focus only 
on our primary interest, the interpretation of regres- 
sion model estimands. Although each model is used 
to fit a slope and expected response values, these 
estimands for the longitudinal response are "apples 
and oranges," not directly comparable due to differ- 
ent factorizations of longitudinal response and sur- 
vival. 

2. BACKGROUND: FOLLOW-UP CENSORED 
BY NONRESPONSE (DROPOUT) 

A brief review of models for longitudinal data with 
monotone dropout provides a foundation for dis- 
cussing analysis of longitudinal data truncated by 
death. Two common, widely applied analysis meth- 
ods for longitudinal data are random effects mod- 
els (Laird and Ware, 1982) and generalized estimat- 
ing equations (GEE) (Liang and Zeger, 1986). By 
modeling a structure for the correlation between 
subjects' longitudinal responses, a correctly speci- 
fied random effects model will yield consistent, un- 
biased estimates of regression parameters by max- 
imum likelihood estimation, even with unbalanced 
data (Laird, 1988). For example, if sicker partici- 
pants drop out, their trajectory of decline in self- 
rated health is continued implicitly by a well-specified 
random effects model. If trends for dropouts can be 
inferred from observed data and parameters for lon- 
gitudinal response and dropout are distinct (miss- 
ing at random, such as when scores decline before 
dropout), the missingness is ignorable and the over- 
all rate of change may be analyzed as if no one has 
dropped out. If the decline in health that leads to 
dropout starts after the last recorded measurement, 
then dropout is nonignorable, and random effects 
models are not an easy solution. Untestable assump- 
tions must be made about nonignorable dropout 
processes to model longitudinal trends (Laird, 1988). 
GEE can accommodate data missing at random if 
estimating equations are weighted by the inverse 
probability of dropout (Robins, Rotnitzky and Zhao, 
1995). Giving additional weight to observed data 
for people who were likely to drop out is similar to 
implicit or explicit imputation of unobserved data. 
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In fact, under some conditions, weighted GEE and 
imputation will give the same results (Paik, 1997). 
Missing at random is often a reasonable assump- 
tion, especially when longitudinal observations are 
closely spaced relative to mechanisms acting on both 
dropout and response. For example, preclinical cog- 
nitive changes could likely be detected by annual 
assessments before a CHS participant becomes im- 
paired by dementia in a way that would lead to 
nonresponse. However, analysis of longitudinal data 
with MAR dropout still requires accurate modeling 
of the regression model (fixed effects) and either cor- 
relation (for random effects models) or dropout (for 
weighted GEE) (Kurland and Heagerty, 2004). 

3. DATA EXAMPLES AND NOTATION 
3.1 Cardiovascular Health Study (CHS) 

The Cardiovascular Health Study (CHS) was a 
population-based prospective longitudinal study of 
5888 adults aged 65 years and older at baseline 
(Fried et al., 1991). Cognitive functioning was as- 
sessed annually for up to 10 years by the Modi- 
fied Mini-Mental State Examination (3MSE, scored 
from to 100) (Teng and Chui, 1987). Our primary 
goal in the CHS analysis is to describe the trajectory 
of cognitive functioning (3MSE) over time, and to 
estimate 3MSE scores at different ages. Gender ef- 
fects are explored in each analysis to add a between- 
person variable of interest to the longitudinal (within- 
person changes) model, and to explore the effects 
of differential survival on different regression ap- 
proaches for longitudinal and survival data. We ex- 
amine the 3814 participants aged 70 years and older 
at baseline. In this cohort, 1356 participants (36%) 
died during follow-up: 44% (744/1703) of men and 
29% (612/2111) of women. We will examine how 
different analysis methods each yield 3MSE trajec- 
tories (rates of change) and fitted values (expected 
cognitive status at specific ages), but address differ- 
ent research aims. 

Although models may be constructed to accom- 
modate both deaths and nonresponse 
(Kurland and Heagerty, 2005), we imputed data 
missing due to nonresponse for simplicity of presen- 
tation. Data were not considered missing if follow-up 
was truncated by death, or censored by the end of 
the study period. (A later cohort to boost minority 
recruitment received 6 annual assessments instead 
of 10.) Most participants (n = 2061, 54%) completed 
all scheduled 3MSE assessments. About 17% of 3MSE 



Table 1 

Longitudinal 3MSE scores for 4 hypothetical CHS 
participants (X = deceased) 









A? 


;e 






Participant 


70 


71 


72 


73 


74 


75 


A ("normal") 


90 


90 


90 


90 


90 


90 


B ("mild cognitive impairment") 


84 


82 


80 


78 


76 


74 


C ("terminal decline") 


84 


80 


76 


X 


X 


X 


D ("terminal decline") 


65 


50 


35 


X 


X 


X 



scores (5174 of 31093) were missing due to partici- 
pant nonresponse, but most participants with miss- 
ing data had only one or two scores missing (n = 948 
participants). Some participants had dropped out of 
the study, and were missing 7 or more scores each 
(n = 134). Nonresponse was accommodated by sin- 
gle imputation using the Markov Chain Monte Carlo 
method of PROG MI in the SAS/STAT software, 
version 9.1 (SAS Institute, Inc., Gary, NC). Impu- 
tation was stratified by time of death (for decedents) 
and recruitment group (for nondecedents) , and was 
modeled based on observed 3MSE values, baseline 
age, gender and recruitment group. 

Accommodation of deaths in CHS data also will 
be described using simplified hypothetical data. Ta- 
ble 1 shows 3MSE data for 4 hypothetical partici- 
pants with a baseline age of 70. Participant A, rep- 
resenting normal cognitive functioning, has a 3MSE 
of 90 points at all assessments. Participant B's lin- 
ear decline from 84 to 74 points over 5 years reflects 
a possible trajectory of mild cognitive impairment 
(which could be interpreted as preclinical Alzheimer's 
disease) . Participants C and D both decline between 
baseline and age 72, and die before age 73. 

3.2 Notation 

Vector Yj represents the longitudinal response (i.e., 
cognitive functioning or quality of life) , measured at 
multiple timepoints for participant i. The dimen- 
sion of Yj may differ for individuals (values of i), 
due to death. For example, in Table 1 (hypotheti- 
cal CHS data), responses for participant A, Ya, are 
the vector (90,90,90,90,90,90), and for participant 
G, Yc, are (84,80,76). Scalar variable Si represents 
survival time for participant i, such as age at death 
or weeks from baseline until death: in Table 1, Sq 
is 73 years. The dimension of Yj is determined by 
the value of 5^, but is not in principle affected by 
data missing due to nonresponse. In practice, the 
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Summary of statistical models for longitudinal response and survival (time of death) 



Statistical model 



Scimple research setting 



Primary analysis 
method 



Comments 



Unconditional f{Yi) 
Describe Yi (longitudinal re- 
sponse) in setting where sur- 
vival (Si) is unrelated to Yi, 
or when death does not result 
in missing data 

Fully conditional: pattern-mixture f{Yi\Si — s) 



Rate of local recurrence follow- 
ing ablation of liver tumors 



Describe Yi separately for 
groups defined by survival 
time 



Longitudinal change in physi- 
cal functioning following stroke, 
separately for < 1, 1-5, and > 5- 
year survivors 



Mixed effects / ran- 
dom effects / latent 
variable regression 



Mixed effects / ran- 
dom effects regression 
stratified by survival 
time 



Fully conditional: principal stratification f{Yi{z)\Si{0) > s,Si{l) > s) 



Describe average causal treat- 
ment effect for stratum that 
would survive past time s re- 
gardless of treatment 



Average QOL difference for 
toxic treatment with greater 
survival, versus less toxic treat- 
ment with lesser survival, in 
subjects expected to live 6+ 
months on either treatment 



Weighted averaging of 
estimated outcomes 
from models such 
as generalized linear 
models 



May implicitly impute 
data beyond death 



Describes individual tra- 
jectories, but uses future 
survival information to 
predict earlier responses 



Weights are unidentifi- 
able and must be ex- 
plored through strong as- 
sumptions and/or sensi- 
tivity analysis 



Fully conditional: terminal decline f{Yi\Si = s) 

Describe Yi counting back- Terminal decline studies 

ward from time of death 



Mixed effects / ran- 
dom effects regression 



e. Partly conditional: regression conditioning on being alive f(Yi\Si > t) 

Describe Yi in the dynamic co- Average physical functioning in GEE 
hort of survivors at each time- survivors at 6 months and 5 
point years after stroke 



f. Joint model f{Yi\Si) 

Describe both Yi and Si for 
example, "probability of being 
alive and healthy" 



Percent of stroke patients who 
are alive and can perform self- 
care 6 months after stroke 



with indepen- 
dence working correla- 
tion 



Logistic regression, 
GEE (binary out- 
come) ; specialized 
multiple response 
methods 



Time scale is retrospec- 
tive 



Describes longitudinal 
trend of dynamic cohort, 
not individuals 

Continuous longitudinal 
outcomes may need to be 
categorized for analysis 



analysis in Section 4 uses imputation of data miss- 
ing due to nonresponse to ensure that the observed 
response vector is of the proper dimension. The joint 
distribution f{Yi,Si) describes the probabihty that 
Yi takes on a vector of specific values, and that par- 
ticipant i dies at a specific time. We assume that 
(YjjS'j) are independent and identically distributed 
over i. 

4. STATISTICAL MODELS FOR 
LONGITUDINAL RESPONSE AND SURVIVAL 

Regression models for longitudinal data describe 
the relationship between predictors and the longitu- 
dinal response, Yj. Because survival Si determines 



the length of Yj and is not fixed, regression models 
of longitudinal data truncated by death must ex- 
plicitly or implicitly model survival as well. A single 
regression model could be built for the joint dis- 
tribution /(Yj,5i), or for factorizations based on 
the definitions of joint and conditional distributions: 
f(Yi\Si)f{Si) or f{S,\Yi)f{Y,). We wih character- 
ize models for Yj as unconditional, fully conditional 
or partly conditional based on how, or whether, the 
longitudinal response model conditions on Si. These 
models are defined in more detail below, and sum- 
marized in Table 2. Each model is applied to the 
hypothetical (Table 3) and actual (Figure 1) CHS 
data. Models are fit to data for all participants, but 
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Table 3 

Hypothetical CHS data (Table 1) estimated age 15 3MSE score and 3MSE slope, using models that accommodate deaths in 
different ways. Even a predicted value for the response, such as "3MSE at age 75, " represents a different estimand for each 

model 



Sample research question(s) 



3MSE at age 75 



Linear 3MSE slope (annual change in 3MSE) 



Unconditional f{Yi) 

What is the expected 3MSE at age 75, in an 
immortal cohort? 

Fully conditional: survivors f{Yi\Si > 75) 
What is the expected 3MSE at age 70 or at 
age 75, for people who live to be at least 75? 
Fully conditional: decedents f{Yi\Si < 75) 
What is the expected 3MSE at age 70 for peo- 
ple who die at age 71-75? 



90+74+64+(~10) 
4 

points 



= 54.5 



90+74 
2 



— 82 points 



o+(-2)+(-4)+(-i5) ^ _5 25 points per year 



^'^^^ '^^ = — 1.0 points per year 



(both deceased at age 75) '^^^ '^^^ = —9.5 points per year 



Fully conditional: principal stratification f{Yi{z)\Si{0) > 9,5^(1) > 9) 

What is the causal effect of gender on expected (causal effect not estimated for hypothetical data) 
3MSE scores at the fourth or ninth survey 
among those who would live to the ninth sur- 
vey regardless of gender? 



Fully conditional: terminal decline f(Yi\Si = s) 

What is the expected 3MSE two years before (not estimated directly) 
death? 



—9.5 points per year (same as row b 
decedents — changing time scale does not 
make a difference with only one stratum of 
decedents) 



Partly conditional: regression conditioning on being alive f{Yi\Si >t) 

What is the expected 3MSE at age 70 for peo- 76.2 -I- 0.92*5 = 80.: 
pie who live to be at least 70, or at age 75 for points 
people who live to be at least 75? 



f. Joint model f{Yi,Si) 

What is the probability of being alive and 
healthy at age 75 for people who were alive 
at age 70? 



4 alive and 3MSE > 80 at 

4 — 

age 75 



0.92 points per year 



on average ^ lose health/life each year 
(since | healthy at age 70, and | healthy 
at age 75) 



Figure 1 shows fitted 3MSE values for a baseline 
age of 70 years. Clearly the estimators (functions of 
sample data) are different for the 6 methods shown 
for finding 3MSE fitted values and slopes. However, 
we emphasize that the estimands are also different: 
the linear slope in an unconditional model is not 
the same as the linear slope in a fully conditional 
model. We assume each estimator is unbiased for its 
estimand, and focus on interpretation of the esti- 
mands. 

4.1 f{Yi) Unconditional 

An unconditional model, f(Yi), is appropriate if 
deaths do not occur, are independent of the response 
process, or do not result in truncation (if the re- 
sponse has a well-defined value following death). If 
these stipulations are not met, the unconditional 



distribution /(Yj) reflects averaging /(Yj|5j) over 
the survival function f{Si), as demonstrated below. 
Unconditional regression models cognitive function- 
ing at all timepoints as if nobody died, in an "im- 
mortal cohort" (Dufouil, Brayne and Clayton, 2004). 
The unconditional average 3MSE at age 75 years in 
the CHS hypothetical data is as follows: 

E(Y|age = 75) 

= E(Y|age = 75, 5 > 75) • P(5 > 75) 

+ E(Y|age = 75,S< 75) • P{S < 75) 
_90_+74 2 ^2 
~ 2 A^' ' 4' 

For analysis methods for which "missing at ran- 
dom" nonresponse mechanisms are ignorable — such 
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a) Uncondftionai b) Fully Conditional: Pattern-Mixture 



CO 
■D 

S 



a. 



o _ 




— - mean female 
mean male 

— sample individuals' fitted trajectories 

1 — 



72 



— r- 

74 



— f— 

76 



— I— 

78 



to 

■D 



a. 




age in years 



age in years 



c) Fully Conditional: Principal Stratification 



d) Fully Conditional: Terminal Decline 




year of study 




e) Partly Conditional 



f) Joint Model 




CO 
T3 



CO 

d 



d 



o 
d 




age in years 



age in years 



Fig. 1. Fitted Modified Mini-Mental State Examination (3MSE) trajectories for CHS participants aged 70 years at baseline, 
using different models to summarize longitudinal response (3MSE) and survival. Fitted values over time are shown as solid 
lines for males, and as dashed lines for females. 



as for likelihood-based models fit to unbalanced lon- 
gitudinal data — the value of ? is imputed implic- 
itly because of the structure imposed by correla- 
tion between each subject's longitudinal observa- 
tions (Laird, 1988). We convey this implicit impu- 
tation by projecting the hypothetical 3MSE data 
of deceased participants based on individual slopes. 
This extrapolation is more extreme than estimates 



would be with real data, since estimation of fixed ef- 
fects in random effects models will be influenced by 
regression to the mean and other shrinkage (Robinson, 
1991). Extending the Table 1 trajectories for Partic- 
ipants C and D linearly, the "incomplete" response 
vectors (84,80,76) and (65,50,35) are imputed to 
(84,80,76,72,68,64) and (65,50,35,20,5,-10). Par- 



LONGITUDINAL DATA TRUNCATED BY DEATH 



7 



ticipant D's imputed response at age 75 (—10 points) 
is inappropriate, outside the range of the 3MSE. 

An unconditional model uses both observed and 
implicitly imputed data to estimate linear 3MSE 
slope and 3MSE at age 75 (Table 3, row a). Com- 
pleting the estimate of 3MSE at age 75, 

T./^.i 90 + 74 2 64 + (-10) 2 
E(Y|age = 75) = • - + ■ - 

= 54.5. 

The age 75 fitted 3MSE (54.5 points) and linear 
3MSE slope (5.25 point decline per year) both yield 
lower estimates of cognitive functioning at age 75 
than are observed for participants alive at age 75, 
because 3MSE values are imputed beyond death. 

The unconditional model of 3MSE scores by age 
in the CHS data (Figure la) is a random effects lin- 
ear regression, separating the effects of age and ag- 
ing (Neuhaus and Kalbfleisch, 1998) in a quadratic 
model with random intercept and first-order poly- 
nomial: 

3MSEij = /3o + boi + /3i • male^ + P2 ■ ageO^ 
+ W3 + bu)- year^j + /S^ ■ year^^ 
+ /35 • malej • year^j + /Sg • male^ • year^^ 

+ ^ij ) 

where ageOj is the baseline age of participant i (in 
years) , and year j^- is the study year (years since base- 
line age) for participant i at timepoint j. malej is a 
dichotomous variable (with value if the participant 
is female and value 1 if the participant is male) , and 
random intercept, slope and error (6oi, bu, and Eij) 
are normally distributed with mean 0. Likelihood- 
based methods such as random effects regression will 
fit an unconditional model, since they treat any im- 
balance in the data as "missing at random." Inter- 
actions between sex and linear and quadratic terms 
are included as potential predictors of interest. Fig- 
ure la shows the fitted "average" trajectories for 
males and females (thick lines, 6oi = bu = Eij = 0), 
and several fitted trajectories for individuals (thin 
lines, selected 6oi find bu). The random intercepts 
and slopes allow a wide range of individual fitted tra- 
jectories. However, time trends and other covariate 
effects are generally interpreted based on the mean 
model (thick lines), which reflects both observed 
data and data implicitly imputed beyond death. Ac- 
cording to this average trajectory, the expected 3MSE 
for both males and females is 86 points at age 75, 



and 77 points at age 79. The unconditional model 
suggests rather strong age-associated declines in cog- 
nitive functioning, but the estimand is the trajectory 
for an immortal cohort in which truncation by death 
does not occur. 

In rare cases, implicit imputation beyond death 
may be reasonable. When evaluating local recur- 
rence following ablation of liver tumors, some liv- 
ers may "die" due to transplant. Tumor marker lev- 
els (Yj) and transplant candidacy (Si) are related. 
However, since the transplant rate will never be 100%, 
an unconditional model and implicit imputation be- 
yond transplant may be valid. The research question 
addressed is, "What would the tumor marker levels 
be if no transplants had occurred, but the markers 
had continued along the exact path that led to trans- 
plant?" While this question is relevant to liver trans- 
plants, the CHS examples show that unconditional 
models are generally inappropriate for longitudinal 
data with considerable imbalance due to death. 

4.2 fiYi\Si = s) Fully Conditional 

Sometimes only subjects who survive to the end of 
the study are included in analysis, or decedents are 
analyzed separately from nondecedents. An anal- 
ogy in the missing data literature is pattern-mixture 
models (Little, 1995; Fitzmaurice and Laird, 2000), 
which stratify by the time of dropout. Pattern-mixture 
models may be fit using the same methods as un- 
conditional models (random effects regression, etc.) 
but are made fully conditional by fitting separate 
regression models to strata defined by time of death. 
Generally a categorical variable defined by 
survival time is used as a main effect (and inter- 
action term) in regression models, so that longitudi- 
nal trajectories are fit for groups defined by time of 
death (Ribaudo, Thompson and Allen-Mersh, 2000; 
Pauler, McCoy and Moinpour, 2003). An advantage 
of this approach is accurate representation of in- 
dividuals' scores over time. Principal stratification 
(Frangakis and Rubin, 2002; Rubin, 2006) examines 
causal effects by using counterfactual survival to cre- 
ate strata. Another stratification based on time of 
death examines the "dying process" for decedents 
only, with years until death as the timescale for ex- 
amining terminal decline (Siegler, 1975; Diehr et al., 
2002; Wilson et al, 2003). 

4.2.1 Pattern-mixture. Computing linear slopes 
separately for decedents and survivors in the hy- 
pothetical CHS data (Table 3, row b) demonstrates 
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minimal decline in survivors [1 point per year] and 
terminal decline for decedents [row b(2), 9.5 points 
per year]. However, since the time of death is not 
known in advance, these models could not be used 
to predict an individual's trajectory based on base- 
line information. 

In a pattern-mixture model fit to CHS data, 
quadratic time trends are stratified by year of death 
relative to baseline. Figure lb shows fitted mean 
3MSE trajectories for a baseline age of 70 years. 
The pattern-mixture model demonstrates terminal 
decline in participants who died (fitted lines that 
end before age 79), and reasonably stable cognitive 
functioning in participants who survived. The fit- 
ted 3MSE at age 75 ranged from 75 points in males 
who died by age 76, to 91 points in males and fe- 
males who enrolled at age 70 and survived at least 
to age 79. The fitted mean trajectories are closer to 
trajectories observed for individuals than the uncon- 
ditional model, but require conditioning on survival 
time, which is not known at baseline. 

4.2.2 Principal stratification. Another fully con- 
ditional approach is to estimate causal effects for 
selected principal strata defined by potential 
survival outcomes (Frangakis and Rubin, 2002; 
Hayden, Pauler and Schoenfeld, 2005; Egleston et al., 
2007; Egleston, Scharfstein and MacKenzie, 2009). 
Men are generally more likely to die than women 
at any given age. Do women have lower incidence 
of mental and physical decline, or are they able to 
survive with greater deficits than men? By examin- 
ing only a hypothetical group that would survive to 
a certain timepoint regardless of gender, we can de- 
couple the association of gender and cognitive func- 
tion from the association of gender and survival. 
We estimate the association of gender and cognitive 
function only in the stratum of patients expected to 
live nine years from age 70-75 regardless of gender. 
The interpretation of these fully conditional models 
differs from pattern-mixture models, because prin- 
cipal stratification models are conditioned on both 
observed and counterfactual survival status. Gener- 
ally, untestable assumptions are necessary to iden- 
tify effects within the principal strata. 

Specification of principal stratification models re- 
quires additional notation to describe potential out- 
comes. For z = {0 for women, 1 for men}, let Di{z) 
be an indicator for potential death by a specified 
time and Yi{z) denote potential 3MSE scores. For 
this analysis, Di{z) = 1 if a person dies within 9 



years of study enrollment, and otherwise. 
Let Si{z) similarly denote the potential survival time 
for person i with gender z, and Si{z) determines 
the dimension of Yi{z). We are interested in the 
3MSE scores among those in whom Di(0) = and 
Di{l) = 0. Let Xi represent a vector of potentially 
confounding covariates; we only include age as a con- 
founding covariate. To identify potential outcomes 
at each time point, we make the explainable nonran- 
dom survival assumption and use the estimator of 
Hayden, Pauler and Schoenfeld (2005). Explainable 
nonrandom survival assumes that Di(z)J-Di{l — z)\Xi 
and Di{z)±Yi{l-z)\Xi,Di{l-z)=0 where _L rep- 
resents conditional independence. This assumption 
heuristically states that counterfactual outcomes will 
be conditionally independent of the observed out- 
comes. We need such an assumption since we do not 
observe counterfactual outcomes for study 
participants; for example, we do not observe 3MSE 
scores that women in the study would have had if 
they had been born men. We also make a pseudo- 
randomization assumption that the potential out- 
comes are independent of gender given Xi as de- 
tailed in Egleston et al. (2007). Pseudo-randomization 
basically states that the "assignment" of gender is 
analogous to a random coin toss given a set of con- 
founding covariates. We appreciate that some might 
feel that gender is inappropriate for such a causal 
analysis even as a didactic example, since gender 
cannot be manipulated (Holland, 1986). 

Figure Ic shows fitted 3MSE scores for partici- 
pants aged 70-75 at baseline who would survive 9 
years regardless of gender. Men and women are both 
predicted to have stable scores on average for four 
years after baseline, and to decline an average of 
3.7 points in men and 5.0 points in women over 
the following five-year period. Trajectories for men 
and for women look similar to the survivor cohort 
in the pattern-mixture model (Figure lb). In other 
words, when survival is held comparable for both 
genders, men may show an advantage in cognitive 
functioning. This could reflect an ability for women 
to survive when cognitive functioning is diminished. 
Cognitive impairment may occur at the same rate in 
men and women, but may be associated with greater 
mortality risk in men. 

One concern with this approach is that explain- 
able nonrandom survival is a very strong 
assumption. A number of sensitivity analysis ap- 
proaches are available to investigate whether devia- 
tions from this assumption could influence 
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inferences (Hayden, Pauler and Schoenfeld, 2005; 
Egleston et al., 2007; Egleston, Scharfstein and 
MacKenzie, 2009). Still, some investigators have 
advocated that the use of relatively strong 
assumptions to uniquely identify principal strata ef- 
fects is justifiable (Joffe, Small and Hsu, 2007; 
Elliott, Joffe and Chen, 2006). 

The principal stratification approach is better suited 
to estimating causal effects for interventions than 
to describing the prognosis for individuals. Survival 
status and nonidentifiable assumptions about coun- 
terfactual survival are both part of the estimand for 
causal effects within each principal stratum. 

4.2.3 Terminal decline. A third fully conditional 
CHS model examines terminal decline. Rather than 
counting forward in years of age, the time scale for 
this analysis counts backward from death. The 2458 
participants who are alive at the end of follow-up 
(64%) are excluded, since their age at death is not 
known. As in earlier models, a random effects model 
is fit with quadratic time and random intercept and 
slope. However, the time scale is changed: 

3MSEjj = /3o + boi + /?! • malci + (32 ■ ageOj 

+ (/33 + ^li) • yr Jr.deathjj- 

+ /34 • yr_fr_death^j 

+ /Ss • malcj • yr_fr_deathjj 

+ /Se • malcj • yr_fr_death^j + Sij , 

where yr_fr_death ranges from —1 (one year before 
the year of death) to —9 (9 years before), and other 
variables are as described in Section 4.1. Figure Id 
shows fitted average terminal decline trajectories for 
men and women with baseline age 70. The estimated 
rate of decline is about 4.7 points per year, plus 
the effect of a negative quadratic coefficient (—0.3). 
The fitted 3MSE score 6 years before death is about 
87 points, close to the average baseline value (88 
points) for the 70-year-olds at baseline. For this co- 
hort of decedents, averaging over yr_fr_death and 
male, the expected 3MSE at age 75 is 85 points, 
and at age 79 is 82 points. The rate of terminal de- 
cline refiects the combined influence of the most dra- 
matic declines and the more stable 3MSE patterns 
observed in the multiple pattern-mixture fitted tra- 
jectories for decedents. 

4.3 f(Yi\Si > t) Partly Conditional 

For partly conditional models, the expected value 
of Yjj (response of subject i at time tij) conditions 



on the subjects being alive at time tij. This con- 
ditioning may seem trivial: after all, data are not 
collected posthumously. However, as is well docu- 
mented for data missing due to nonresponse (Laird, 
1988), analysis methods that model the correlation 
structure of longitudinal data (such as mixed mod- 
els) will implicitly impute responses, whether miss- 
ing due to dropout or death (Dufouil, Brayne and 
Clayton, 2004; Kurland and Heagerty, 2005). Partly 
conditional regression models may be estimated by 
assuming independence among the longitudinal re- 
sponses. In that sense, they may be fit using lin- 
ear regression or generalized linear models. How- 
ever, generalized estimating equations (with inde- 
pendence working correlation) allow estimation of 
sandwich standard errors. 

Partly conditional "regression conditioning on be- 
ing alive" (Kurland and Heagerty, 2005) (RCA) de- 
scribes 3MSE scores at different ages among the sur- 
viving participants. For the hypothetical CHS data, 
RCA estimates of average 3MSE at age 75 and lin- 
ear 3MSE slope are calculated using linear regres- 
sion (Table 3, row e). Implicit imputation is avoided 
by treating observations from the same person as 
independent. RCA accurately shows that the preva- 
lent cognitive functioning level is slightly higher at 
age 75 (82 points is the average 3MSE for survivors 
A and B, estimated as 80.8 by imposing a single 
linear slope to all observed data), compared to age 
70 (80.8 point average for A-D, estimated as 76.2 
by linear regression). The partly conditional slope 
predicts that average 3MSE increases 0.92 points 
per year, despite that no individuals have increas- 
ing 3MSE scores. Partly conditional regression re- 
fiects 3MSE in the dynamic cohort of survivors, not 
individual subjects change in cognitive functioning. 

The partly conditional RCA model avoids implicit 
imputation of data for deceased subjects, and de- 
scribes longitudinal 3MSE for the dynamic cohort of 
survivors (Dufouil, Brayne and Clayton, 2004; 
Kurland and Heagerty, 2005). The regression equa- 
tion is similar to that for the unconditional and 
pattern-mixture models, but does not include a ran- 
dom intercept or slope: 

3MSEij = /3o + /3i • malcj + j32 ■ ageOj + /Js • year^^ 

+ Pi ■ yeaifj + P5 ■ malej • year^j 

+ (3q ■ malci • yeaxfj + Eij . 

Figure le shows expected 3MSE scores for partici- 
pants who entered the study at age 70, given that 
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they were alive at the time 3MSE was measured. 
The difference in expected 3MSE at different ages 
is smaller than for the unconditional or fully condi- 
tional models. The expected 3MSE score of partic- 
ipants who entered CHS at age 70 is 91 points for 
surviving 75-year-olds, and 87 points for surviving 
79-year-olds. However, this does not imply an aver- 
age decline of 1 3MSE point per year, since the av- 
erage for survivors is not the same as the trajectory 
from following individuals. The partly conditional 
model tracks the prevalent average 3MSE score in 
the survivors at each timepoint. 

4.4 f{Yi, Si) Joint Model 

A joint response encompassing both survival and 
the longitudinal response also may be of interest. 
A patient facing a diagnosis may ask not only the 
chance of 5-year survival but the chance of 5-year 
survival with acceptable quality of life. A joint model 
of the probability of being alive and healthy 
(Diehr et al., 1995) characterizes the status of the 
entire cohort with respect to a longitudinal response 
and death. A related approach rescales response mea- 
sures to predict the probability of being alive and 
healthy in a prescribed amount of time, such as 
one year (Diehr et al., 2001a). Because the joint re- 
sponse (alive and healthy) is defined at all time- 
points for all individuals, longitudinal data are bal- 
anced. Therefore, analysis methods (random 
effects, GEE, etc.) will not be affected by differential 
survival. Joint models also may assess 
treatment effects simultaneously for longitudinal re- 
sponse and survival (Gray and Brookmeyer, 2000; 
Ratcliffe, Guo and Ten Have, 2004), or integrate mor- 
bidity and mortality in utility measures such as 
quality- adjusted life years. 

Defining 3MSE scores > 80 points as healthy, the 
probability of being alive and healthy at age 75 in 
the hypothetical CHS data is 1/4 (Table 3, row f), 
which reflects a decline in the cohort, since 3/4 were 
alive and healthy at baseline (see Table 1). Assum- 
ing a linear trend, the decline from 3/4 healthy to 
1/4 healthy over 5 years reflects a rate of decline of 
1/10 of the cohort losing health or life each year. 

Figure If shows the proportion of CHS partici- 
pants who are alive and healthy at ages 70-79. The 
percent alive and healthy (PAH) shown is a pro- 
portion reflecting the status of study participants 
enrolled at age 70. Modeling of PAH also is pos- 
sible, as well as constructing confidence intervals 



around the PAH estimate (Johnson, 2002). The per- 
cent alive and healthy is not always decreasing: in- 
dividuals may regain "health," such as when a low 
3MSE score was due to short-term side effects of 
medication. Like the partly conditional model, the 
joint model describes the cohort, rather than trends 
for individuals. Unlike the partly conditional model, 
the entire cohort is described at each timepoint, not 
only survivors. (The end of follow-up for the minor- 
ity recruitment group could affect differences seen 
between ages 70-76 and 77-79. We avoid implicit 
imputation beyond 6 years by computing the em- 
pirical PAH as a simple proportion.) 

For participants aged 70 years at baseline, the 
probability of being alive and having 3MSE > 80 
at age 75 is 0.82 for females and 0.75 for males. 
At age 79, the PAH is 0.70 for females and 0.54 
for males. Summing the area under the curve, the 
average years of healthy life (of 9 possible) is 7.6 
for females, and 6.7 for males (Diehr et al., 1998). 
The average gender difference appears to be greater 
for the joint model than for the other fitted mod- 
els in Figure 1. This refiects a survival advantage 
for females, which was not apparent in models that 
focused on the 3MSE. 

4.5 Other Factorizations 

A factorization of the joint distribution f{Yi,Si) 
not yet discussed is /(5j|Yj)/(Yj). This framework 
is especially applicable to predicting survival (Si) 
using information from longitudinal biomarkers (Yj) 
(De Gruttola and Tu, 1994; Wulfsohn and Tsiatis, 
1997; Ye, Lin and Taylor, 2008). Applying this model 
to hypothetical CHS data, we could conclude that 
33% (1/3) of participants with declining 3MSE sur- 
vive to age 75, while 100% (1/1) with stable 3MSE 
survive to age 75. This class of models would be 
categorized as unconditional in the framework dis- 
cussed here, but is not considered in detail because 
survival, not longitudinal data, is the primary re- 
sponse of interest. 

5. DISCUSSION 

Through analysis of hypothetical and actual data 
sets, we have shown that choice of analysis has a 
great influence on interpretation of longitudinal data 
truncated by death. No single approach is appro- 
priate in all situations, so the analysis should be 
chosen to address the aims of a research project. 
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Summaries of individual trajectories and descrip- 
tions of terminal decline are achieved with fully con- 
ditional models, in which analysis of longitudinal re- 
sponse is stratified by time of death. The principal 
stratification approach is most suited for estimat- 
ing meaningful exposure or treatment effects. For 
example, in an investigation of the effect of trauma 
centers on functional outcomes (Egleston, Scharf- 
stein and MacKenzie, 2009), principal stratification 
could adjust for healthy survivor bias in nontrauma 
centers. The partly conditional model can address 
situations where prevalence, rather than individual 
trajectories, are of interest. For example, survival in- 
formation could estimate the number of new Medi- 
care recipients who will be alive in 10 years, and 
a partly conditional model could then estimate the 
need for dementia services in those survivors. For 
studies of palliative care, treatment effects may be 
described by a joint model for longitudinal response 
and survival. The area under the joint density curve 
would summarize treatment differences in both sur- 
vival and quality of life response. 

Once the statistical model is clarified by research 
aims, choice of analysis method should be apparent 
(Table 2). An unconditional model is fit by random 
effects and other multilevel approaches, when the 
time scale does not depend on survival times. A fully 
conditional model may be fit by random effects or 
other analysis methods, with time scale or stratifica- 
tion depending on survival time (Diehr et al., 2002; 
Pauler, McCoy and Moinpour, 2003). Partly condi- 
tional models may be fit directly by GEE with inde- 
pendence working correlation (Kurland and Heagerty, 
2005). Joint models may be fit as a multivariate joint 
distribution (Gray and Brookmeyer, 2000), or as a 
composite response incorporating both survival and 
longitudinal response (Diehr et al., 2001a). 

We have focused on research in which estimation 
of a parameter, such as a treatment effect, slope 
or patient trajectory, is of primary interest. Other 
classes of statistical analysis are available to address 
different research questions. Additionally, models fit 
using one factorization of the joint distribution of 
survival and longitudinal response may be trans- 
formed to address aspects of another model. For 
example, a fully conditional model may be marginal- 
ized (Heagerty and Zeger (2000)) to estimate a partly 
conditional estimand, such as the expected 3MSE 
among CHS survivors at age 85 (Fitzmaurice and 
Laird, 2000); a joint model may estimate fully condi- 
tional trajectories (Ribaudo, Thompson and Allen- 
Mersh, 2000). 



We have not addressed situations involving inter- 
val censoring or unknown survival times. While a 
two-stage imputation method (Harel et al. (2007)) 
could address both nonresponse and missing sur- 
vival information, the data analysis method should 
be chosen carefully to avoid implicit imputation and 
to address research aims. 

In longitudinal studies in which some subjects die 
yet another response, such as cognitive functioning 
or quality of life, is of primary interest, careful mod- 
eling is required to identify an analysis method to 
address research aims. When deaths occur at many 
different times along the time frame for which re- 
sponses are measured (i.e., age or time from base- 
line), random effects models (which are uncondi- 
tional with respect to survival) may implicitly im- 
pute data beyond death. Implicit imputation is a 
fundamental strength of random effects models in 
the missing data context, but limits the suitabil- 
ity of these unconditional models in analyzing lon- 
gitudinal data with great imbalance due to death. 
When the time scale describes time from (not un- 
til) death, the model becomes fully conditional. For 
terminal decline models, implicit imputation beyond 
death will not occur when random effects models are 
fit. Analysts concerned about the potential impact 
of implicit imputation may fit a generalized linear 
model or generalized estimating equations with in- 
dependence correlation (which fit partly conditional 
models) and compare fitted parameters to an un- 
conditional model. 
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